Ensemble Learning With Manifold-Based Data Splitting for Noisy Label Correction
نویسندگان
چکیده
Label noise in training data can significantly degrade a model’s generalization performance for supervised learning tasks. Here we focus on the problem that noisy labels are primarily caused by mislabeled confusing samples, which tend to be concentrated near decision boundaries rather than uniformly distributed, and whose features should equivocal. To address problem, propose an ensemble method correct exploiting local structures of feature manifolds. Different from typical strategies increase prediction diversity among sub-models via certain loss terms, our trains disjoint subsets, each being union randomly selected seed samples’ nearest-neighbors same class manifold. As result, only limited number will affected locally-concentrated labels, sub-model learn coarse representation manifold along with corresponding graph. The constructed graphs used suggest set label correction candidates, accordingly, determines results majority decisions. Our experiments real-world datasets demonstrate superiority proposed over existing state-of-the-arts.
منابع مشابه
Mining Multi-Label Data Streams Using Ensemble-Based Active Learning
Data stream classification has drawn increasing attention from the data mining community in recent years, where a large number of stream classification models were proposed. However, most existing models were merely focused on mining from single-label data streams. Mining from multi-label data streams has not been fully addressed yet. On the other hand, although some recent work touched the mul...
متن کاملMulti-Label Manifold Learning
This paper gives an attempt to explore the manifold in the label space for multi-label learning. Traditional label space is logical, where no manifold exists. In order to study the label manifold, the label space should be extended to a Euclidean space. However, the label manifold is not explicitly available from the training examples. Fortunately, according to the smoothness assumption that th...
متن کاملRobust ensemble learning for mining noisy data streams
a Institute of Computing Technology, Chinese Academy of Sciences, Beijing, 100190, China b Centre for Quantum Computation & Intelligent Systems, University of Technology Sydney, Broadway, NSW 2007, Australia c Research Center on Fictitious Economy and Data Science, Chinese Academy of Sciences, Beijing, China d College of Information Science & Technology, Univ. of Nebraska at Omaha, Omaha, NE 68...
متن کاملEnsemble-Based Discriminant Manifold Learning for Face Recognition
The locally linear embedding (LLE) algorithm can be used to discover a low-dimensional subspace from face manifolds. However, it does not mean that a good accuracy can be obtained when classifiers work under the subspace. Based on the proposed ULLELDA (Unified LLE and linear discriminant analysis) algorithm, an ensemble version of the ULLELDA (En-ULLELDA) is proposed by perturbing the neighbor ...
متن کاملIsometric Correction for Manifold Learning
In this paper, we present a method for isometric correction of manifold learning techniques. We first present an isometric nonlinear dimension reduction method. Our proposed method overcomes the issues associated with well-known isometric embedding techniques such as ISOMAP and maximum variance unfolding (MVU), i.e., computational complexity and the geodesic convexity requirement. Based on the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Multimedia
سال: 2022
ISSN: ['1520-9210', '1941-0077']
DOI: https://doi.org/10.1109/tmm.2021.3119861